Spoken Language Identification with Phonotactics Methods on Minangkabau, Sundanese, and Javanese Languages
نویسندگان
چکیده
منابع مشابه
Methods for Spoken Language Identification
In this paper, we explore several machine learning techniques for classifying spoken language. In particular, we construct algorithms which utilize various spectral features derived from English and Mandarin Chinese phone call audio to predict the language to which the phone call belongs. We investigate multiple feature sets and modeling approaches, and find that Gaussian Mixture Models, combin...
متن کاملFatty acids intake among diverse ethnic groups in Indonesia
The use of dietary pattern specifically fatty acids intake should prove to be an informative and powerful means to augment our understanding of the role of diet in chronic disease particularly CHD. Cross sectional study was implemented to describe the nutrients intake specifically fatty acids intake of 4 (four) ethnic groups in Indonesia, such as Minangkabau, Sundanese, Javanese and Buginese. T...
متن کاملRecent progress in developing grapheme-based speech recognition for Indonesian ethnic languages: Javanese, Sundanese, Balinese and Bataks
With the advent of globalization, multilingualism in Indonesia gradually faces a state of catastrophe. Currently among 726 ethnic languages spoken in Indonesian archipelago, 146 are endangered. Several projects have been initiated for cultural preservation which can prevent the endangered language from being lost. Nevertheless, the available technology that could support communication within in...
متن کاملSpoken Language Identification Using Hybrid Feature Extraction Methods
This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...
متن کاملEvaluation of language identification methods using 285 languages
Language identification is the task of giving a language label to a text. It is an important preprocessing step in many automatic systems operating with written text. In this paper, we present the evaluation of seven language identification methods that was done in tests between 285 languages with an out-of-domain test set. The evaluated methods are, furthermore, described using unified notatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2016
ISSN: 1877-0509
DOI: 10.1016/j.procs.2016.04.047